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ABSTRACT 

We present a proof-of-concept analysis of photometric redshifts with Bayesian priors on physi¬ 
cal properties of galaxies. This concept is particularly suited for upcoming/on-going large imaging 
surveys, in which only several broad-band filters are available and it is hard to break some of the 
degeneracies in the multi-color space. We construct model templates of galaxies using a stellar pop¬ 
ulation synthesis code and apply Bayesian priors on physical properties such as stellar mass and star 
formation rate. These priors are a function of redshift and they effectively evolve the templates with 
time in an observationally motivated way. We demonstrate that the priors help reduce the degener¬ 
acy and deliver significantly improved photometric redshifts. Furthermore, we show that a template 
error function, which corrects for systematic flux errors in the model templates as a function of rest- 
frame wavelength, delivers further improvements. One great advantage of our technique is that we 
simultaneously measure redshifts and physical properties of galaxies in a fully self-consistent manner, 
unlike the two-step measurements with different templates often performed in the literature. One may 
rightly worry that the physical priors bias the inferred galaxy properties, but we show that the bias 
is smaller than systematic uncertainties inherent in physical properties inferred from the SED fitting 
and hence is not a major issue. We will extensively test and tune the priors in the on-going Hyper 
Suprime-Cam survey and will make the code publicly available in the future. 

Subject headings: surveys — galaxies: distances and redshifts — galaxies: statistics 


1. INTRODUCTION 

Galaxies form at high density peaks of statistical den¬ 
sity fluctuations in the Universe. They are thus statis¬ 
tical objects in nature, and a large survey of the sky is 
the key observation to measure their physical properties. 
This has m otivated recent su rveys such as Sloan Digital 
Sky Sur vey (lYork et aO2000fl . U KIRT Deep Infrared Sky 
Survey ([Lawrence et al.l I2007T ) , Canada-France-Hawaii 
Telescope Legacy Survey, and many others. Following 
them, there are a number of on-going/planned surveys, 
including the Hyper Suprime-Cam Survey (Miyazaki et 
al. in prep.), VST and VISTA public surveys, the Dark 
Energy Survey (|The Dark Energy Survey Collaboration! 
I2005f h Euclic0, WFIRST (iSnergel et a,l.l20 lTH'i , a nd Large 
Synoptic Survey Telescope (llvezic et al.l 120081) . These 
surveys are going to observe a large fraction of the sky 
down to unprecedented depths in order to address some 
of the outstanding astrophysical questions today, such 
as the nature of dark energy, with a superb statistical 
accuracy. 

Most of these large surveys are imaging surveys. The 
information we can obtain directly from imaging data 
is unfortunately very limited; positions, apparent fluxes, 
apparent sizes, and shapes in a given set of filters, and 
their time variability if multi-epoch data are available. In 
order to study the nature of distant objects, we need to 
translate these apparent quantities into physical quan¬ 
tities such as luminosities and physical sizes. Distance 
information is required for this job. Also, weak-lensing 
cosmology, which is one of the major science goals of 
the large surveys, requires redshifts (or at least a mean 
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redshift) of source galaxies. A precise redshift can be 
measured from a spectroscopic observation. It is, how¬ 
ever, practically not possible to measure precise redshifts 
even for 1% of objects from these surveys with any of the 
existing/near-future facilities. Furthermore, most of the 
objects will be fainter than the spectroscopic sensitivity 
limits. There are currently two techniques to overcome 
these problems: photometric redshift and clustering red¬ 
shift. 

Photometric redshift is a technique to infer redshifts of 
objects from pho t ometr y in a set of filters, first demon¬ 
strated bv iBauml ( 1962 ). Thanks to the progress in nu¬ 
merical techniques, it now has two branches: spectral 
energy distribution (SED) fitting and machine-learning. 
The first branch is the traditional technique and it relies 
on a priori knowledge of SEDs of galaxies. Extens i ve dis¬ 
cussio ns on the technique can be found in Walcher et al. 
(2011 b Our new code presented here falls in this cat¬ 
egory and we will elaborate on the (dis) advantages of 
the SED fitting later. A number of photo-z codes have 
been p ublished and t he most pop ular ones would includ e 
BPZ (iBenfted l2000f). ZE BRA (iFeldmann et al.l 1200611 . 
LePha re (lArnouts et all 119991 : lllbert et al.l 120061) . and 
EAZY (jBrammer et al.1 120081 1 . The second branch is 
purely numerical (i.e., no physics) and the idea is to re¬ 
late observables such as colors to redshifts in one way 
or another using a training set, which has to repre¬ 
sent an input photometric sample. A number of nu¬ 
merical tech niques have been app l ied such as polyno - 
mial fitting (jConnollv et, all 11 9951: iHsieh V Ye ef I20l4l . 
artificial neural network d Collister fc bahaT 2004fl and 
prediction trees dCarrasco Kind fe Brunner] 120131 b Ex¬ 
tensive comparisons between some of these public SED- 
based/ machine-learning photo-z codes have been carried 
out bv iHildebrandt et all (l2010f) . 
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Clustering redshift utilizes the positional information, 
not flux information, of objects. The idea is sim¬ 
ple — cross-correlation between two galaxy samples 
yields a signal where they overlap in redshift and the 
clustering signal can thus be used as a redshift infer¬ 
ence. Several authors have explor ed this technique with 
slightly differen t objectives 1 e.g.. iSchneider et all 120061 
Newmanl 120081: lErben et al.l 120091: iMcQuinn fcW hitc 
20131: IMenard et al.l 1 201 3l) ~ One powerful application 
of this technique is to use a sample of spectroscopic 
redshifts, in which the redshift distribution is precisely 
known, as a reference. If one makes a narrow redshift 
bin with a spectroscopic sample and cross-correlate it 
with a photometric sample, the clustering signal is pro¬ 
portional to the number of photometric galaxies in that 
redshift bin. By shifting the spectroscopic redshift bin, 
one can in principle reconstruct a r edsh ift distri b ution 
of the input photometric sample (see IMenard et al.ll2013i 
for application of this technique for a few specific cases). 
However, there is one uncertainty here; the clustering 
signal is also proportional to the bias of the photometric 
sample, which is not s traightforward to measure a priori 
(jde Putter et al.ll2014H . This can be a serious issue when 
the photometric sample covers a wide range of redshift 
or has multiple peaks in the redshift distribution, which 
may often be the case in real analysis. 

There are pros and cons in these techniques and one 
should choose one that is best suited for his/her science 
application. We focus on the traditional SED fitting 
technique in this paper for two reasons: (1) machine¬ 
learning techniques require a representative training 
sample, which in practice means they do not go fainter 
than the spectroscopic limits and (2) the clustering tech¬ 
nique does not give redshifts of individual galaxies (it 
gives dN/dz of the input photometric sample) 0- The 
1st point is a serious limitation of the machine-learning 
techniques because objects that we are concerned about 
are often fainter than the spectroscopic limit. One can of 
course extrapolate to faint magnitudes, but an extrapo¬ 
lation always requires a great care. With the SED fitting 
technique, we can expect to go fainter than the spectro¬ 
scopic limit provided that our understanding of SEDs 
of galaxies is reasonable. The 2nd point is also impor¬ 
tant because, if one cannot infer redshifts of individual 
galaxies, one cannot estimate their physical properties, 
making galaxy science difficult. 

The SED fitting technique has its own problem; it relies 
on our a priori knowledge of SEDs of galaxies. Observed 
SEDs of galaxies are often used as templates, but these 
templates are almost exclusively collected at z = 0. The 
multi-color space at high redshifts may not be fully cov¬ 
ered by those templates because high redshift galaxies 
are younger than galaxies at z = 0. One can instead 
use a stellar population synthesis (SPS) code to remedy 
the issue with its flexibility to generate SEDs of a given 
age. However, SPS models do not perfectly reproduce ob- 
serve d colors of galaxies due to systematics in the models 
(e.g., iMaraston et al.l[2009T) . Furthermore, it is easy to 

3 In principle, one can make a dense grid on the color-magnitude 
space and apply the clustering technique to construct P(z ) for each 
grid. But, it will require a huge number of spectroscopic redshifts 
and is unlikely a practical method at this point. It will not work 
for rare objects either. 


generate models that are physically unreasonable such 
as a template with zero SFR with a large amount of ex¬ 
tinction. Such templates increase the degeneracy in the 
multi-color space, resulting in poor photo-z accuracies. 

In this paper, we choose to use SPS models with a 
novel approach to overcome these problems; (a) we ap¬ 
ply Bayesian priors on physical properties of galaxies to 
constrain the parameter space of SPS models within re¬ 
alistic ranges as a function of redshift and (b) correct 
for systematic flux errors using template error functions 
(which are an extended version of template error func¬ 
tions often adopted in the literature). This technique 
is still in its early phase of development and the paper 
presents a proof-of-concept analysis of the physical priors 
and template error functions. 

The layout of the paper is as follows. We give a brief 
overview of the paper in Section 2, followed by a descrip¬ 
tion of how we generate model templates in Section 3. 
Section 4 presents the framework of our technique and 
describes the physical priors in detail. We define a tem¬ 
plate error function in Section 5 and then move on to 
demonstrate how these priors and template error func¬ 
tions improve photo-z’s in Section 6. We compare our 
code with BPZ used for CFHTLenS in Section 7. Sec¬ 
tion 8 aims to characterize biases in inferred physical 
properties of galaxies introduced by the priors and tem¬ 
plate error functions. Finally, we discuss future direc¬ 
tions in Section 9 and summarize the paper in Section 
10. Unless otherwise stated, we adopt a flat universe 
with H n = 70 km s -1 Mpc -1 , Dm = 0.27 and Da = 0.73 
(jKomatsu et al.l 120111) . As one of the major goals of 
upcoming/on-going surveys is weak-lensing cosmology, 
it is important to emphasize that our photo-z’s are not 
dependent on a specific choice of cosmology. We simply 
need to assume a popular cosmology often adopted in 
the literature in order to use the observed relationships 
between galaxy properties as priors (see Section 4 for 
details). 

2. OVERVIEW OF OUR PHOTO-Z TECHNIQUE 
AND THE SCOPE OF THE PAPER 

First of all, we give a brief overview of our technique 
in this section. Thanks to the enormous amount of work 
on galaxy populations over a wide range of redshift, we 
now have a reasonable understanding of galaxy prop¬ 
erties across cosmic times. We apply Bayesian priors 
based on our knowledge of galaxy properties to SPS mod¬ 
els, in which physical properties such as stellar mass are 
known, to constrain the model parameters within real¬ 
istic ranges. We let the priors evolve with redshift in 
order to account for evolutionary effects. A fixed set of 
templates are often used in photo-z estimates, but the 
priors effectively evolve the templates in an observation- 
ally motivated manner. 

Our approach here is suited for upcoming/on-going 
large imaging surveys such as HSC, DES and LSST, 
which image the sky only in a limited number of filters 
in the optical wavelengths. If one has photometry in a 
large number of filters spanning a wide wavelength range 
with sufficient S/N, which may often be the case in deep 
fields, one does not necessarily have to apply priors be¬ 
cause there is enough information in the data to constr ain 
t he ov erall spectral shape of objects. In fact. lllbert et all 
(|2009f ) did not apply any priors because they had 30-band 


























Mizuki 


3 


photometry. However, with the limited photometric in¬ 
formation in the large surveys, it is difficult to break 
some of the degeneracies in the multi-color space, result¬ 
ing in poor photo -2 accuracies. We demonstrate below 
that the priors developed here are particularly useful to 
break (or least weaken) the degeneracies in such situa¬ 
tions and they significantly improve photo- 2 's. 

In addition to the physical priors, we introduce a tem¬ 
plate error function, which comes in two terms; system¬ 
atic flux stretch and flux uncertainty as a function of 
rest-frame wavelength. The first term is to reduce mis¬ 
matches between SPS SEDs and observed SEDs and the 
2nd term assigns uncertainties to the model templates 
to properly weight reliable parts of templates. The tem¬ 
plate erro r function is not a new idea and h as been used 
by E AZY (iBrammer et al.1120081 ) and fast ()Kriek et, al.1 
2009). But, they include only the 2nd term. We intro¬ 
duce the 1st term in order to reduce systematic offsets in 
| z spec — Zphot | and improve the overall photo -2 accuracy 
as we demonstrate in Section 6. 

One of the strengths of our code is that it infers, in 
addition to redshifts, physical properties of galaxies in 
a self-consistent manner. A popular procedure to infer 
the physical properties is to first estimate photo- 2 ’s with 
empirical or PC A templates, and then change templates 
to SPS models to infer physical properties. SPS models 
are usually not used to infer photo- 2 ! s because they de¬ 
liver poor accuracy. But, as we demonstrate in Section 
6, the physical priors and template error functions im¬ 
prove SPS-based photo- 2 ’s and we do not need take this 
2-step approach anymore. We can infer both redshifts 
and physical properties simultaneously using the same 
set of templates. This is important because redshift and 
SED shapes are partially degenerate and thus they have 
to be measured in a self-consistent manner. With our 
technique, we can properly propagate uncertainties in 
redshift into uncertainties in physical properties. 

Our code is called mizuki . A preliminary version 
of the code was u sed in I Tanaka et all d2013allbl ') and 
iSantos et al.1 (|2014l h This paper gives a full description 
of the code as it stands today. It is still in an early phase 
of development and one of the aims of the paper is to 
identify areas where further work is needed. We focus on 
galaxies in this paper, but our goal is to compute rela¬ 
tive probabilities that an object is star, AGN/QSO, or 
galaxy with corresponding P{z) for each clasfl Tests 
on AGNs and stars are being carried out and we defer 
detailed discussions to our future paper. 

3. STELLAR POPULATION SYNTHESIS 
TEMPLATES 

We use spectral temp l ates o f galaxies generated with 
the IBruzual fe Chariot! (|2003h code. To the first or¬ 
der, a specific choice of SPS code is not a major con¬ 
cern because a template error function described in Sec¬ 
tion 5 will reduce systematic differences between the 
model and observed SEDs of galaxies. In fact, we have 
confirmed that an updated version of the code, which 
incorporates a revised treatment of thermally pulsat¬ 
ing AGB stars, deliver similar photo-z accuracies. We 
choose to use the older version because there is a grow- 

4 P(z) for star is simply Piz) = Pstarfi(z), where S is the Dirac 
delta function. 


ing body of evidence for thermally pulsating AGB stars 
being unimport ant in integrated optical-nearIR spectra 
of galaxies (e.g..lConrov fe Gunnll20lQ : iKriek et~aHl 20101 : 

iZibetti et, al.ll2013l i. although these stars may be hidden 
by dust and may be an important contributor to thermal 
emission in the IR. 

One needs to make several assumptions to generate 
SEDs with an SPS code. These assumptions include, ini¬ 
tial mass function (IMF), attenuation curve, star forma¬ 
tion history, and metallicity distribution. The first two 
are often assumed to be time-invariant. None of them is 
known a priori for a given galaxy, and the assumptions 
employed inevitably introduce systematic uncertainties 
in the physical parameters such as stellar mass inferred 
from SED fitting. We choose to employ popular as¬ 
sumptions often adopted in the literature so that we can 
use the published results on galaxy properties as priors 
(see the next section fo r deta ils). To be specific, we as¬ 
sume the C habrier IMF (IChabriedl2003f) . Calzetti attenu¬ 
ation curve dCalzetti et al.ll2000D . exponentially decaying 
star formation rates (i.e., r-model), and solar metallicity. 
IBruzual & Chariot (2003) models do not include nebular 
emissi on lines and s o we add them using the intensity ra¬ 
tios in llnouel (|2011h with the iCalzettil (1997) differential 
attenuation law. The Lyman a escape fraction is as¬ 
sumed to be 10% at all redshifts. Thermal dust emission 
is yet to be included in the models, but it does not affect 
results presented in this paper. Absorption du e to the 
Lyman alpha forest is applied following iMadau ( 199 51. 

Let us briefly discuss the validity of some of these as¬ 
sumptions. A realistic spectral model should consider a 
metallicity distribution, but we assume that all the stars 
ha ve so l ar me tallicity as often assumed in the literature. 

IConrov et al.l ( 2009 ) showed that broad-band evolution 
of multi-metallicity galaxy SEDs is equivalent to that of 
single mean metallicity population in the redward of the 
U-band. We should keep in mind that we suffer from 
multi-metallicity effects, in addition to the evolutionary 
effects, at high redshifts where we often observe rest- 
frame UV of galaxies. The biggest uncertainty in our 
models is perhaps star formation histories. It is a funda¬ 
mentally difficult problem to reconstruct star formation 
histories from broad-band photometry due to the fact 
that a small amount of recent star formation can domi¬ 
nate the over all SED, hi ding li ght from old stellar pop¬ 
ulations (jMaraston et al.11201 0). The only way to solve 
the problem might be to resolve galaxies into individ¬ 
ual stars, but that is not a practical solution for galax¬ 
ies at cosmological distances. Given this fundamental 
difficulty, we prefer to be simple and take the popular 
assumption of r-models. 

We generate SPS templates for ages between 0.05 and 
14 Gyr with a logarithmic grid of ~ 0.05 dex. For r, 
in addition to the single stellar population model (i.e., 
r = 0) and constant SFR model (i.e., r = oo), we assume 
0.1 Gyr < r < 11 Gyr with a logarithmic grid of 0.2 dex. 
Optical depth in the U-band goes between 0 and 2 with 
a step of 0.1 with an addition of ry = 2.5, 3, 4, and 5 
models to cover very dusty sources. These templates are 
then redshifted to 0 < 2 < 7 with A 2 = 0.01 and are 
convolved with response functions of a given instrument 
to generate a library of synthetic fluxes. A synthetic 
broad-band flux is computed as 
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J dvS u f u /v 
< / di/5,,/i/ 


(1) 


where 5), is the response function of an instrument 
(which includes the atmospheric transmission for ground- 
based facilities), and f v is the input spectrum. In total, 
we have about 2 million templates. 


4. PHYSICAL PRIORS 


We now describe the physical priors. We begin with 
the Bayesian framework and then move on to describe 
each of the physical priors. 

4.1. Framework 


We follow the Bayesian framework to constrain the pa¬ 
rameter space of the SPS models. Let to be an array 
of input photometric information in a given set of fil¬ 
ters. We aim to infer photometric redshift ( z p hot ) from 
to. Physical properties of an input SPS model can be 
specified by 


G = G(Z, t v , r, age), (2) 

where parameters on the right are metallicity, optical 
depth in the R-band (i.e., amount of attenuation), e- 
folding time scale of star formation rate, and age of the 
model. At this point, a model is normalized such that 
initial SFR is 1 M 0 yr . Let us introduce a normal¬ 
ization factor, a, in order to be able to compute stellar 
mass and SFR, which scale linearly with the normaliza¬ 
tion. Stellar mass and SFR are important because they 
can be measured more precisely in observations than the 
other parameters and we will extensively use them in 
what follows. A posterior probability of finding z and G 
from a given to is 

P(z,G\rh) <x j daP(rh\z,G,a)P(z,G,a). (3) 

The prior on redshift and physical properties of galax¬ 
ies, P(z,G,a), needs some consideration. Ideally, we 
would like to apply the prior P(z, G , a) on all the param¬ 
eters simultaneously, but we do not have sufficient ob¬ 
servational constraints on the distributions of the phys¬ 
ical parameters and their redshift dependence. Ob- 
servationally available constraints are often correlations 
between some limited combinations of the parameters. 
One instead could use semi-analytic models to construct 
P(z, G , a), but semi-analytic models do not perfectly re¬ 
produce galaxy properties in the real universe and the 
priors may well be biased. 

What this prior effectively does is to constrain the 
physical parameters of the SPS models within realistic 
ranges. This motivates us to interpret the prior broadly 
and consider constraining the parameter space by com¬ 
bining observed relationships between physical proper¬ 
ties of galaxies available in the literature (which are ba¬ 
sically in form of marginalized probability distribution 
functions). We assume that the prior can be factorized 
by 


This set of priors is not fully equivalent to P(z,G,a ), 
but we are limited by the available observational in¬ 
formation on these physical parameters. As mentioned 
above, metallicity is fixed to solar. But, thanks to the 
strong dust-age-metallicity degeneracy, the multi-color 
space is covered by the dust and age variations. Metallic- 
ity is diffic ult to infer from broad-band photometry (e.g., 
iPforr et al.ll2012f) and it is unlikely a useful prior in any 
case. Because of the experimental aspect of the priors, 
we regard the analysis in this paper as a proof-of-concept 
analysis rather than fully contained work of physical pri¬ 
ors. We plan to further test and explore physical priors 
in the future. 

The likelihood P(m\z,G,a) is computed as 

D ,-.\ A n ( X 2 (z,G,a)\ 

P{m\z,G,a) oc exp I--- I, (5) 


X 2 (z,G,a) = Y 

i 


( fi,obs , model (rG )) 2 

v(fi,obs ) 2 


( 6 ) 


where fi t0 b s and ft,model are observed and model fluxes 
in the i-th filter and cr(/i.obs) is the observed flux uncer¬ 
tainty in that filter. We note that we use linear fluxes 
in the above equations, not logarithmic magnitudes. We 
will later introduce a template error function, which as¬ 
signs uncertainties to model fluxes. The above equations 
can be easily extended to add the model uncertainties to 
the observed flux uncertainties in the quadrature. 

Using Eqs. 4-6, we can now work out Eq. 3. However, 
the marginalization over a turns out to be a computa¬ 
tionally expensive task. We find that, in our cases dis¬ 
cussed in the following sections, P(rh\z,G,a) is sharply 
peaked around a that minimizes y 2 defined in Eq. 6, 
which can be computed by 



This can be explicitly written as 


(7) 


&best 


E fi,obsfi,model 
i <?(fi,obs )' 2 


Er 


KIT, 


odel 


( 8 ) 


Motivated by the sharp probability distribution function 
(PDF) around ab es t , we have compared photo-Rs com¬ 
puted with the marginalization over a and those com¬ 
puted with a = abest without marginalization. It turns 
out that they are almost identical: er(A z p hot) ~ 0.001 
with no systematic offset between the two runs. This 
level of scatter is much smaller than those we discuss later 
and it does not affect our results at all. Very faint objects 
(e.g., those with photometric uncertainties larger than 
0.5 mag in all filters) may have a significant P(m\z, G, a) 
over a wide range of a, but photo -2 uncertainties for such 
faint objects are large in any case and the extra uncer¬ 
tainty coming from assuming a = abest will be negligible. 
For these reasons, we assume 


P(z, G, a) = P{z)P{SFR\M *, z)P(t v \SFR, z ) 

P(age\M tl z). (4) 


P(z, G|m) oc 


J daP(m\z,G,a)P(z,G,a) 
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^ P[jn\z, Gy CXbest )r*(~, G, CXbest ) - (9) 

A photo -2 PDF can then be obtained by marginalizing 
over the model parameters: 

P(z|ro) oc ^ P(z, G|to). (10) 

G 

Our choice of the model grids are described in the pre¬ 
vious section. A PDF is then normalized such that the 
probability integrated over redshift equals unity. In a 
similar way, we can compute PDFs for physical prop¬ 
erties (using ctbest for SFR and stellar mass) with all 
the other parameters marginalized over. We shall em¬ 
phasize that redshift is also marginalized over and the 
uncertainty in redshift is properly included in PDFs of 
physical properties. These PDFs will be very useful for 
galaxy studies. 

In the following, we first define the N(z) prior and then 
move on to describe the physical priors used to constrain 
the parameter space of the SPS models. One may rightly 
worry that the physical priors bias the inferred physical 
properties. We will quantify biases introduced by the 
priors in Section 8 and show that they are small. 


4.2. N(z) 

A redshift distribution of galaxies, N(z), is often used 
as a prior for photo-z’s in the literature and is proba¬ 
bly the most popular one (it is denoted as P(z) in Eq. 
4). This prior is essentially an apparent, observed-frame 
luminosity function of galaxies times unit volume inte¬ 
grated over redshift. We could use a more physically 
oriented prior such as rest-frame luminosity function or 
stellar mass function. After some experiments, it turned 
out that the N(z) prior give similar improvements to lu¬ 
minosity and stellar mass function priors. Because N(z) 
is expected to introduce less bias in the inferred physical 
properties of galaxies, we choose to use N(z). 

Constraining N(z) is not a trivial task because it re¬ 
quires unbiased redshifts of a large number of objects 
down to faint enough magnitudes that we are interested 
in. One of the ways would be to use se mi-analytic mod¬ 
els as done by iBrammer et al.l (120081 ). although semi- 
analytic models may not perfectly reproduc e the N(z) 
in the real universe (jHenriques et all 12012 ). We here 
take an empirical app roach and use v ery accurate photo- 
z estimates f romlllbert et al.1 (120 091 to construct N(z). 
We note that lllbert et al.l ( 2009^ did not apply any N (z) 
prior. 

There are a number of f unctional forms of N(z) 
adop t ed in the literature (e.g. , Benitez 2000 : lllbert et al.1 
120091: ISchrabback et al.l 120101: IHildebrandt et al.l 120121) . 
As discussed bv iLe Fevre et al.l (1201 3p ~ N(z) shows a 
high redshift tail at faint magnitudes. We slightly tweak 
the functional forms adopted by these authors and take 
the form of 


N(z) oc (2+0.1)“ 


exp 



+ /3exp 



(n) 

where zo, a, and /3 are free parameters. This 
form has fewer free par amet ers than those adopt ed in 
ISchrabback et al.l (|2010l) and ILe Fevre et ahl (1 20 131) . but 


z-mag range 

a 

P 

ZQ 

17.0-18.0 

2.76 

0.00 

0.20 

18.0-19.0 

2.89 

0.10 

0.26 

19.0-21.0 

3.06 

0.30 

0.34 

21.0-21.5 

2.32 

0.00 

0.53 

21.5-20.0 

2.30 

0.00 

0.59 

20.0-22.0 

2.26 

0.00 

0.44 

22.0-22.5 

2.16 

0.01 

0.64 

22.5-23.0 

2.07 

0.06 

0.69 

23.0-23.5 

1.72 

0.30 

0.74 

23.0-24.0 

1.82 

0.15 

0.71 

24.0-24.5 

1.32 

0.01 

0.82 

24.5-25.0 

1.33 

0.01 

0.88 


TABLE 1 

Fitted parameters for N{z\m,i). 


the high redshift tail is described well by the 2nd ex¬ 
ponent. We also introduce a ’soften ing’ parameter sug¬ 
gested by IHildebrandt et all (l2012f) to avoid too small 
probabilities at low redshifts, which can be problematic 
for bright objects. 

This prior can be extended t o include magnitude de¬ 
pendence, P(z\m), as done bv iBenfteS (j2000[ ). We fit 
the f unctio nal form in Eq. [TT] to the photo-z catalog 
from lllbert et al.l (120091) in a number of Aband magni¬ 
tude bins. A sample fit is shown in Fig. [Hand the fitted 
parameters are summarized in Table [T] We note that 
one of the most popular codes bpz applies a template- 
dependent N(z) prioi0, but we assume that this prior is 
independent of templates. 



Z 

Fig. 1.— N(z) for galaxies with 23.0 < i < 23.5. The histogram 
with points is the observed N(z) and the curve is a fit to it. 


4.3. P(SFR|M*.z) 


There is a well-known correlation between SFR and 


stellar mass (e.g., 

Daddi et al.l 2007): |Elbaz et al.l 20071; 

iWuvts et all 2011b 

: Whitaker et al. 20121). This relation 


has been extensively studied across redshifts and we use 
it as a priori knowledge to constrain the model templates. 
The relation between SFR and stellar mass is not quite 
linear, but for simplicity, we assume a linear correlation 
between SFR and stellar mass M*; 

5 This template-dependent P(z) prior constrains relative frac¬ 
tions of galaxy populations (e.g., quiescent vs. star forming) as a 
function of redshift. Therefore, a care is needed when it is used for 
galaxy population studies. 





























































6 


Tanaka 


SFR SF (M*,z) = SFR*(z) x M 0 yr” 1 , (12) 

where 


SFR* ( 2 ;) 


10 x (1 + z) 2 ' 1 (z < 2), 
19 x (1 + z) 1 ' 5 (z > 2). 


(13) 


At z = 0, a typical SFR of a star forming galaxy with 
1O 11 M 0 is IOMq yr -1 and SFR at fixed mass increases 
at higher redshifts such that SFR is 10 times larger at 
z = 2. At higher redshifts, there is currently no consen- 
sus ab out the evolution of the star formation sequence. 
iGonzalez et al.1 (l2010i) suggested that the evolution of 
sSFR (which is proportional to SF R at fixe d mass) flat¬ 
tens out at z > 2, while [Stark et all ([2013t l hinted at a 
gradual increase at hig her redsh ift s. We assume a grad¬ 
ual increase shown in lStark et al.1 (|2013f ) with a redshift 
dependence of (1 + z) 1 ' 5 . We have confirmed that there 
is no strong effect on photo-z accuracies if we assume no 
evolution at z > 2 for the photo-z tests performed in 
Section 6. 

While star forming galaxies show a strong correlation 
between SFR and stellar mass, quiescent galaxies are off 
from that relation. We define a sequence for quiescent 
galaxies as the star forming sequence offset by —2 dex. 
SEP fitting do es not reproduce SFRs at very low SFRs 
dPacifici et al.1 l2012lf and we find that this form works 
well after some experiments. SFRs can numerically be 
very small (e.g., 10 -9 M 0 yr -1 ) and SFRs lower than 
—2 dex from the star forming sequence are forced to be 
—2 dex just for computational reasons. 

Putting all this together, we assume that galaxies ex¬ 
hibit the two distinct sequences formed by star form¬ 
ing and quiescent galaxies and a probability of finding a 
galaxy with a star formation rate of SFR with a given 
stellar mass at redshift z can be expressed as a sum of 
two Gaussians: 


P(SFR|M*,z) oc 


1 

-exp 

CSF 


1 / log SFR — log SFR sf (AA, z) 

2 V CSF 


2 ' 


1 

d-exp 


1 / log SFR - log SFRsf(M„ z) + 2 

2 V 


2 ' 


(14) 


sensitive to the choice of the scatters within reasonable 
ranges. Fig. [2] schematically illustrates this prior. 



M* [M q ] 

Fig. 2.— Schematic plot of SFR vs stellar mass prior at 2 : = 2. 
A darker color means a higher probability. Note that this prior is 
redshift dependent and the prior at z = 0 for instance is offset by 
— 1 dex in SFR. 


4.4. P(t v \SFR,z) 

An amount of attenuation is known to p osit ivel y corre¬ 
late with SFR as well as s tellar mass fe.g.. lHopkins et all 
120031 : iGarn fe Bestll2010h . This fact has often been em¬ 
ployed in phot o-z computations in the literature. E.g., 
iFontana et al.l (12004H limited the attenuati on to Ay < 
0.6 for low-SFR galaxies with age It > 4, and lllbert et al.1 
(j2009f ) did not apply extinction to their quiescent tem¬ 
plates. Here, we formulate the relationship and irnple- 
ment it as a pr ior. 

iSobral et al.l (12012H suggested that the relationship be¬ 
tween attenuation and stellar mass seems to remain un¬ 
changed out to z ~ 1. One might use this interesting 
relation as a prior, but the problem is that the relation 
is actually bimodal — at a given stellar mass, there are 
star forming and quiescent populations and only the for¬ 
mer suffers from attenuation. To avoid this bimodal¬ 
ity, we use the redshift-dependent SFR-attenuation rela- 
tioifl We define an evolution corrected SFR as 


where SFRsf(A/*, z) is the mean SFR for a star forming 
galaxy at redshift z and with stellar mass M* defined 
in Equation (1121) . We assume the relative fraction be¬ 
tween the two populations is 1:1 at all redshifts to be 
conservative, ctsf and ctq are the dispersions in each of 
the star forming and quiescent sequence. Here we take 
<r sf = 0.3 dex and ctq = 1 dex. The observ ed scatter of 
the s equence may be smaller (e.g., 0.2 dex; iDaddi et al.l 
l2007|h but sta rbursting galaxies are of ten located off from 
the sequence foodighie ro et all 1201 111 and we choose to 
assume a slightly large scatter. As for the quiescent se¬ 
quence^ lowSFRs are difficult to measure from SEDs 
dPacifici et a 1 .1120121 ) and we adopt a large scatter. But, 
we should emphasize that photo-z accuracies are not very 


SFR 0 = 100 


SFR 

SFR* (z) ’ 


(15) 


where SFR*(z) is defined in Eq. [Inland is used to elimi¬ 
nate the redshift dependence of the SFR-attenuation re¬ 
lation here. A factor of 100 is motivated to set the thresh¬ 
old SFR above which Ty correlates with SFR to unity. 
We adopt a Gaussian form for this prior: 


6 The redshift independence of the mass-attenuation relation 
might appear odd at a first glance given the evolving relation¬ 
ship between SFR and stellar mass discussed above. The evolving 
SFR-mass relation seems to be compensated by the evolving SFR- 
attenuation relation, making the mass-attenuation independent of 
redshift. 
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P(tv |SFR, z) oc exp 


1 f Ty - < Ty > 

2 V °Vv 


2 ' 


where 


(16) 


< Ty > = 


0.2 (SFR 0 < 1), 

0.2 + 0.5logSFR 0 (SFR 0 > 1). 


(17) 


If a galaxy has a low SFR with its normalized SFR less 
than unity, the mean attenuation is fixed to a small value 
of 0.2. This non-zero value is adopted in order to leave 
room to compensate for metallicity variations. But, we 
have confirmed that photo- z accuracies do not signifi¬ 
cantly change if we set it to zero. For the same reason, 
we adopt a relatively large a TV of 0.5. The threshold 
SFR 0 is equiva lent to SF R=O.1M 0 yr -1 at z = 0 and is 
motivated bv lGarn &; Bestl (12010D . At higher SFRs, the 
mean attenuation increases with SFR. The adopted func¬ 
tional form reproduces the observation by IGarn fc Bestl 
(|2010f ). The prior is schematically illustrated in Fig. [3] 
We should note that the attenuation that IGarn fc Bestl 
(1201011 measured is for Ha, which is known to su ffer f rom 
a larg er attenuation than the stellar continuum dCalzettil 
119971 ). The ry parameter in our models defines attenua¬ 
tion for stellar continuum and thus the mean attenuation 
assumed above should have been smaller by about a fac¬ 
tor of 2. However, we find that the shallower relation 
results in poorer photo-z accuracies. We speculate that 
this is due to a combined effect of (a) assumption of solar 
metallicity for all models and (b) dependence of attenu¬ 
ation curve on SFRs. As mentioned above, there is de¬ 
generacy between metallicity, age, and attenuation. We 
fix the metallicity to solar and let age and attenuation 
vary to compensate for the metallicity variation in real 
galaxies, which may affect t h e SFR-atte nuation relation. 
Also, as discussed bv lllbert et all (I2009H . the attenuation 
curve may be dependent on SFR, which also affects the 
relation. This is obviously an area where further work is 
needed. 



SFR 0 [M 0 yr ‘] 

Fig. 3.— Schematic plot of ry vs SFRo prior. SFRo is a function 
of redshift. For reference, SFRo = 1 M© yr -1 corresponds to 
SFR= 0.1 M© yr -1 at z = 0 and SFR= 1 M© yr -1 at z = 2. 


It is known, from experience, that templates with very 
young ages often give poor redshift accuracies and /or in- 
accurate phys i cal pr o perties (e.g., iFontana et al.l 12003: 
Pozzetti et al.1 120071: iTanaka et all 120101 : iWuvts et al l 
2011 all . This is primarily due to degeneracy introduced 
by these young templates; young templates all look simi¬ 
lar regardless of their star formation timescales and they 
look similar even to older templates with long star for¬ 
mation timescales when only a small number of filters 
are available. All this is because the overall spectrum is 
dominated by young stars. Very young templates and/or 
templates with short r are often manually excluded to re¬ 
duce the degeneracy in the literature. Rather than man¬ 
ually removing them from the library, we prefer to put 
it on a more physical ground. 

A crude, time-averaged mean star formation rate of a 
template can be given by 


< SFR > = Mt,/age. (18) 

This is not strictly a mean SFR because the stellar mass- 
loss is not accounted for but is a reasonable proxy. Ex¬ 
cept for starbursting galaxies, steady-state star forming 
galaxies (i.e., those on the star formation sequence) have 
SFRs of up to a few hundred M 0 yr -1 . Those are the 
most massive galaxies whose stellar mass (or equivalently 
SFR here) function follows an exponential form. Moti¬ 
vated by this, we assume a function of 


P(age\M *, z) oc exp 


< SFR > 
SFR*(z) 


(19) 


This form gives a low probability to young templates 
and is effectively similar to the hard-cut priors adopted 
in the previous studies. A notable difference is that this 
form is more effective for more massive galaxies; a mas¬ 
sive galaxy with young age has a low probability, while 
a low-mass galaxy with the same age has a high proba¬ 
bility. This makes sense because low-mass galaxies are 
expected to be younger than massive galaxies. Another 
difference is that the characteristic SFR evolves with red¬ 
shift in an observationally motivated way. This prior is 
schematically shown in Fig. [I] 




redshift 

Fig. 4.— Schematic plot of the < SFR > vs redshift prior. 


4.5. P(age\M*,z) 


4.6. Probability distribution function 

The code produces a redshift PDF and the full PDF 
information should be used for any science analysis. But, 
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it may be useful to make a point estimate for each object 
for certain purposes. A PDF of a galaxy can be quite 
complex and one faces a question of how to make such a 
point estimate. The most popular ways include: (1) peak 
of the PDF, (2) weighted mean, and (3) the median of 
the PDF. The definition of the first one is obvious. The 
latter two are defined as 

/ z max 

zP(z)dz, (20) 

and 

fZmedian 

/ P(z)dz = 0.5, (21) 

J Zmin 

where z rnln and z max are 0 and 7 in our case. After some 
experiments, we find that the median redshift works best. 
One might expect that the weighted redshift is a reason¬ 
able method because it uses the full PDF. But, a PDF 
often has multiple peaks and, depending on the relative 
probabilities of the peaks, the weighted redshift often lies 
in between the peaks. One can imagine a simple case in 
which there are two peaks with the same probability in 
a PDF, and the weighted redshift will be exactly in the 
middle of the peaks, where there is little probability. 

The peak redshift is often adopted in the literature, but 
the median redshift works better. It is likely because the 
median redshift is a better tracer of the ’primary’ peak in 
PDF. Peaks in a PDF have different heights and widths 
and the peak with the largest integrated probability, not 
the peak probability, is more likely to be a correct red¬ 
shift. This primary peak could be better identified by 
integrating the probability as done in the median red¬ 
shift, not by locating the highest peak. One can explore 
an algorithm to directly identify the primary peak rather 
than taking the median of PDF, but for computational 
simplicity, we use the median redsliifts in this paper. 

5. TEMPLATE ERROR FUNCTION 

We use model templates generated with an SPS code. 
These model templates are subject to systematic un¬ 
certainties. Such uncertainties include deviation of star 
formation histories of real galaxies from the simple r- 
model, ignorance of metallicity distribution, and errors 
in the stellar evolutionary track (which introduces an 
age-dependent systematics), etc. These systematic un¬ 
certainties cause mismatches between model SEDs and 
observed SEDs of galaxies. We use a template error func¬ 
tion to crudely correct for such system atics. The tem¬ 
plate e rror function is first introduced bv lBrammer et, ahl 
( 2008 ) and it is used to assign random uncertainties to 
templates. But, we extend it to include correction for 
systematic flux errors. 

We define the template error function with two terms; 
systematic flux stretch and random flux uncertainty. The 
first term is a flux correction to model templates as a 
function of rest-frame wavelength and is meant to re¬ 
duce mismatches with observed SEDs. This correction 
is typically small, of order a few percent, but can be as 
large as 10 per cent at some wavelengths. The second 
term assigns random uncertainties to the model fluxes 
to weight reliable parts of the SEDs, which is reasonable 


because model templates are often calibrated in the rest- 
frame optical and they can be more uncertain at other 
wavelengths. 

A template error function can be easily coupled with 
zero-point offsets in photometric data in real analysis. In 
order to separate these two components, we first correct 
for systematic zero-point offsets by fitting spectroscopic 
objects located at low redshifts with their redsliifts fixed 
to their spectroscopic redsh ifts. Given that the Sloan 
Digital Sky Survey (lYork et ah 200 0) provides a large 
number of galaxies at 2 < 0.2 over a wide area of the 
sky, we use objects located at z < 0.2 and take the me¬ 
dian of fobs/fmodel to estimate the zero-point offsets in a 
given filter. This process normally converges (i.e., offsets 
become less than 1%) after a few iterations. Note that 
no template error functions are applied in this process. 

We then construct a template error function. For this, 
we need a large sample of spectroscopic redshifts over a 
wide range of redshift. This is fairly demanding and there 
are only a few deep fields that allow us to make error 
functions usi ng spec-z’s (e.g ., Chandra Deep Field South 
as done in lTanaka et all12013at h A practical alternative 
would be to use very accurate photo-Us computed with 
many bands, which can be as accurate as a(Az/(l + 
z)) ~ 0.01, for this job. We use the ugriz photometry 
a nd t he precise photometric redshift from lllbert et al.l 
(12009H to construct template error functions. We do not 
include medium-bands nor narrow-bands here because 
we are interested in flux errors smoothed over a typical 
bandwidth of broad-band filters. Narrow/Medium-band 
filters will give flux stretches on a finer wavelength scale. 
In other words, if one would like to apply a template 
error function to medium-band filters, that function has 
to be estimated separately from broad-bands. 

We compute the systematic differences between the 
observed fluxes and the best-fit model fluxes using ob¬ 
jects with precise photometric redshifts. Because objects 
spread over a range of redshift, we can effectively cover a 
wide range of rest-frame wavelength with the broad-band 
photometry alone. The median of fobs/fmodel at a given 
rest-frame wavelength is used to correct for the system¬ 
atic template errors (1st term) and dispersion around the 
median is used as an uncertainty of the model templates 
at that rest-frame wavelength (2nd term). A typical pho¬ 
tometric uncertainty in the observed data is subtracted 
off from the measured dispersion in order to derive intrin¬ 
sic uncertainties in the models. In principle, one could 
apply such a correction to templates of a given spectral 
type. But, for now we simply make several redshift bins 
and apply a single “master” correction to all the tem¬ 
plates in each bin in this paper. 

Fig. [5] shows the template error functions in several 
redshift bins generated with the COSMOS data. The 
corrections are typically a few percent, but there are a 
few specific rest-frame wavelength ranges where t h e cor - 
rections can be as large as 10%. iMaraston et al.l (12009T ) 
showed that the observed g — r color of passively evolv¬ 
ing galaxies at z ~ 0.4 is bluer by about 0.1 mag com¬ 
pared to models with an empirical stellar spectral library. 
Although a direct comparison with our template error 
functions cannot be made because we include star form¬ 
ing galaxies as well here, it is consistent that models are 
too red around the rest-frame 4000A by about 10%. It 
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is interesting to note that the template error functions 
evolve with redshift; the correction at ~ 3500A increases 
towards z ~ 1. This has an implication for an age- 
dependent systematics in the SPS models, but a detailed 
study of it is beyond the scope of this paper. A template 
error function may provide an interesting way to address 
issues with SPS models using photometric data. 



2 - i i * $ $ 

0.0<z<0.3 0.3<z<0.6 0.6<z<1.0 1.0<z<2.0 2.0<z<7.0 

1000 2000 5000 10 4 

rest—frame wavelength [A] 

Fig. 5.— Template error functions in several redshift bins. The 
points show flux stretches and the error bars show intrinsic un¬ 
certainties in the model fluxes. The symbols are explained in the 
figure. 


6. IMPROVEMENTS IN PHOTOMETRIC 
REDSHIFTS 

We move on to demonstrate how the physical priors 
and template error functions improve photo-Vs. We 
first describe the input photometric data and then de¬ 
fine quantities to characterize photo-Vs. After that, we 
present our photo-Vs and discuss some of the outstand¬ 
ing issues. 

A photo-z accuracy is dependent on available filters 
and we focus on griz and ugriz photometry in the rest 
of the paper because these are the filter sets often avail¬ 
able in large surveys; the HSC survey and DES observe 
in grizy , CFHTLenS (which will be discussed in the next 
section) in ugriz , and LSST in ugrizy. The y-band pho¬ 
tometry is not available in the public COSMOS data and 
thus the photo-z accuracy we discuss below is a lower 
limit of these surveys. Another reason for this limited 
filter set is because priors play an important role. Many 
filters over a wide wavelength range provide a strong con¬ 
straining power on SED shap es and redsh ifts and priors 
become less important (e.g., Illbert et~aLl 120091 did not 
apply any priors because they had the 30-band photom¬ 
etry). Priors are more effective when less photometric 
information is available and the ( u)griz photometry is a 
good starting point to investigate how effective the phys¬ 
ical priors introduced in Section 4 are. 


l2003lf . and zCOSMOS dLillv et all l2007h . The deep¬ 
est of these is VVDS-UltraDeep, which is a flux lim¬ 
ited survey at 23.0 < i < 24.75. It is a very unique 
source of absorption line redshifts of high redshift galax¬ 
ies, which are mostly missed from shallower surveys. Un¬ 
fortunately, however, even this deepest survey is still not 
deep enough to calibrate and validate photo-Vs in the on¬ 
going/upcoming surveys, which reach i ~ 26 or deeper. 
There is currently no spectroscopic sample that allows 
us to calibrate photo-Vs down to this faint magnitude. 
Imaging goes deeper than spectroscopy. 

A practical calibration ladder to reach faint magni¬ 
tudes would be to use many-band photo-Vs often avail¬ 
able in deep fields. One nota ble example of t his is photo- 
Vs available in COSMOS (|Ilbert et al.ll2009D . and in this 
section, we compare our griz photo-Vs with the 30-band 
photo-Vs down to i = 25.0. This is not as deep as 
we would like, but the public catalog does not contain 
fainter galaxies. Also, the COSMOS data are proba¬ 
bly not deep enough to obtain good photo-Vs at fainter 
magnitudes. The catalog is i-selected and is suited to 
test our code in the context of large surveys, in which 
objects are often detected in the i-band for weak-lensing 
science. There are more recent near-IR selected catalogs 
(|Ilbert et alJl2013t iMuzzin et al.ll2013l) . but most of the 
above mentioned large surveys are optical surveys and 
we stick with the i-selected catalog. We note that the ef¬ 
fects of the physical priors and template error functions 
demonstrated below are not dependent on the detection 
filter. We show in the Appendix that they indeed signif¬ 
icantly improve photo-Vs for a A'-selected catalog. 

We add 0.02 mag uncertaint y in the quadrature to 
the cataloged uncertainties in lllbert et al.1 (l2009t l because 
the uncertainties seem to be underestimated (reduced 
X 2 are often large for bright objects). This affects only 
bright galaxies for which uncertaint ies are sm all er t han 
~ 0.02 mag. We also note that Illbert et~abl (j2009i l in¬ 
creased the uncertainties by a factor of 1.5 (which affects 
all galaxies) for their photo-z analysis. 

6.2. Quantities to characterize photo-z 

There are a few standard quantities used to character¬ 
ize photo-z accuracies; bias, dispersion, and outlier rate. 
The definitions adopted in the literature are not always 
the same and we explicitly define them here for this work. 

• Bias: Photo-z’s may systematically be off from 
spectroscopic redshifts and we call this system¬ 
atic offset bias. We compute a systematic bias in 
(2phot - Zs n fl r)/(1 + -Zsn er) by applying the biweight 
statistics (IBeers et al.lll990H . We iteratively apply 
3 <t clipping for 3 times to reduce outliers. 

• Dispersion: In the literature, dispersion is often 
computed as 


6.1. Input photometric catalog 

Obvious sources to characterize the accuracy of photo- 
z’s are spectroscopic redshifts. There are a number of 
flux-limited/color-selected spectroscopic surveys in the 
literatu re. Some of th e largest ones would i nclud e 
SDSS (lYork et al l 12001 GA MA (IDriver et, all [Mill 
WP S (ILe Fevre et all12005ft . V IPERS (IGuzzo et a! 


I20l3lf . PRIMUS (I Coil et.alJl20 ill) . DEEP2 ([Davis et al 


u conv = 1-48 x MAD ( Zp ^ Zspec ) , (22) 

where MAD is the median absolute deviation. Note 
that this definition does not account for the sys¬ 
tematic bias. In addition to this conventional defi¬ 
nition, we also measure the dispersion by account¬ 
ing for the bias using the biweight statistics. We 
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case 

physical priors 

N(z) prior 

template errfn 

bias 

<J 

Joutlier 

Oconv 

foutlier,conv 

i 

No 

No 

No 

+0.011 

0.143 

30.9% 

0.182 

45.2% 

2 

Yes 

No 

No 

-0.017 

0.101 

24.2% 

0.112 

30.2% 

3 

Yes 

Yes 

No 

-0.027 

0.094 

19.7% 

0.100 

24.7% 

4 

Yes 

Yes 

Yes (w/o offset) 

-0.019 

0.082 

20.6% 

0.087 

22.2% 

5 

Yes 

Yes 

Yes (w/ offset) 

-0.001 

0.084 

20.8% 

0.081 

22.9% 

6 

No 

Yes 

Yes (w/ offset) 

+0.004 

0.101 

19.1% 

0.095 

25.8% 


TABLE 2 

Summary of photo-z accuracies. 


iteratively apply a 3er clipping as done for bias to 
measure the dispersion around the central value. 
We denote the conventional dispersion and the bi¬ 
weight dispersion as cr C onv and cr, respectively. 

• Outlier rate: The conventional definition is 


N 


/outlier,conv — 


( I-phot —spec 
1 + ^spec 


> 0.15 


N< 


total 


(23) 


where outliers are defined as |z p hot — z spe c|/(l + 
2 spec ) > 0.15. Again, this definition does not ac¬ 
count for the systematic bias. The threshold of 0.15 
is an arbitrary value but is probably fine for photo¬ 
n’s with several bands. It is clearly too large for 
those with many bands. Together with this con¬ 
ventional one, we also define outliers as those 2<j 
away from the central value (these a and center are 
from biweight; see above). This 2cr is an arbitrary 
choice, but it is motivated to match reasonably well 
with the conventional one for several band photo¬ 
n’s. We will denote the er-based outlier fraction as 
/outlier and the conventional one as /outlier,conv 


6.3. Photo-z vs. spec-z 

Let us now demonstrate how the priors and template 
error function improve photo-n. Table [2] summarizes the 
combination of the priors and template error functions 
and the resultant photo-n accuracies discussed in this 
section. We shall emphasize that some of the many-band 
photo-n’s that we compare with may be incorrect and 
thus we should not over-interpret the numbers discussed 
below in the absolute sense. 

We start with the top-left panel of Fig. [6] (case 1), in 
which photo-n’s computed without any of the physical 
priors and template error functions are compared with 
the high-accuracy photo-n’s. As indicated in the plot, 
the scatter and outlier rate are both very high and the 
photo-n’s are not very useful for any science applications. 
The mean bias is small (+0.01), but this is because a neg¬ 
ative bias at n < 1 is compensated by a positive bias at 
higher redshift by chance. The biggest failure mode is 
that a large fraction of low-n galaxies up-scatter to high- 
n. This is because the griz photometry probes redwards 
of the 4000A break of these galaxies, where there is no 
prominent spectral feature. The blue continuum of low-n 
star forming galaxies in the optical can be confused with 
blue UV continuum of galaxies at high redshifts, resulting 
in the up-scattered population. The overall poor photo-n 
accuracy is the primary reason why SPS templates are 


not usually used for photo-n’s and the two-step proce¬ 
dure with different templates is often adopted to esti¬ 
mate, e.g., stellar mass. We focus on a limited number 
of filters here, but SPS templates still give poorer ac¬ 
curacy than empirical tem pla tes ev en when more filters 
are used. E.g.. iDalrlen et all (2 01311 carried out photo-n 
tests with 14 filters for a number of codes and the typ¬ 
ical performance of SPS-based codes is worse than that 
of empirical codes. 

If we apply the physical priors (case 2 in Table [2] and 
Fig. O, both the scatter and outlier rate significantly de¬ 
crease. In particular, our primary failure mode of the up- 
scattered low-n galaxies is suppressed. This is an impor¬ 
tant result and it proves the concept of our technique; the 
physical priors are indeed useful for improving photo-n’s. 
Public photo-n codes use a fixed set of SED templates re¬ 
gardless of redshift. The physical priors employed here 
effectively evolve the templates with redshift and the ob¬ 
served improvements suggest that the template evolution 
is actually important for photo-n’s. We have not used in¬ 
formation like sizes and morphologies here and there is 
clearly room for further improvements. 

We then apply the standard N(z) prior (case 3). As 
expected, this prior is efficient in removing low-n galaxies 
scattered out to high redshifts due to the degeneracy in 
the multi-color space discussed above. Both the overall 
scatter and outlier rate are reduced by the prior. But, 
the backside of it is that it is also effective in reducing 
correct photo-n’s at n > 3. This is because those high 
redshift galaxies are rare and N(z) gives correspondingly 
small probabilities at high redshift. A preliminary analy¬ 
sis shows that this can be remedied by applying stronger 
physical priors with a less strong N(z) prior (e.g., one 
can introduce a ’floor’ in N(z) such that the minimum 
probability at a given redshift is some small fraction of 
the peak probability). This is also an area where further 
work is needed to achieve good photo-n’s across the en¬ 
tire redshift range. We also should note that our N(z) 
prior is constructed from the lllbert et~akl (1200911 catalog 
and we apply it to the same catalog here. We will need 
to extensively test the prior on a different catalog (see 
next section for comparisons with CFHTLenS). 

In addition to the priors, we apply the template er¬ 
ror functions (case 4). The outlier rate (/ outlier) slightly 
increases, but this is due to the reduced a (recall that 
outliers are defined as > 2cr). In fact, / outlier,conv, which 
is based on the fixed outlier definition, is reduced. The 
overall accuracy is improved by the template error func¬ 
tions and this means that the systematic mismatches be¬ 
tween the SPS templates and observed SEDs are in fact 
an issue in photo -2 and they have to be corrected. The 
implication here is not only for SPS models but for ob- 
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Fig. 6.— z phot plotted against z 30band . The solid line is z phot = z 30band and the dashed lines show \z phot -z 30band \/(l+z 30band ) = ±0.15. 
Objects above/below these lines are regarded as outliers in the conventional definition. The plots are for 6 cases defined in Tablc[2] Objects 
with i < 25 are included and every 30 objects are plotted for clarity. No object clipping is applied in the statistics here. 
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served SED templates as well; a care must be taken when 
one applies observed SED templates collected at z = 0 
to high redshift galaxies because there will be systematic 
mismatches in the SEDs. 

One may notice that there is a horizontal feature at 
Zphot ~ 0.5 both at low-z and high-z. Only faint {i > 24) 
galaxies contribute to this feature. These faint galaxies 
have a wide and relatively flat PDF at 0 < z < 0.8, due 
to the large photometric uncertainties and to the limited 
filter set. For such PDF, the median redshift tends to 
cluster around z p hot ~ 0.5. At higher redshift, more than 
2 filters fall below the 4000A break and we can achieve a 
reasonable accuracy until we get to z ~ 3, where the g- 
band starts to fall below the Lyman break and again the 
photometric uncertainty becomes very large. As shown 
in the next section, the additional u-band largely removes 
this feature. 

Our overall photo-z accuracy may n ot a p pear as g ood 
as those achieved in, e.g., DES (iSanchez et alJ 12014 ). 
But, there are important differences in the depths, the 
re dshift range consi dered, and analysis method used. 
ISanchez et al.l (l2014f) studied galaxies with i < 24 at 
z < 1.5 and clipped 10% of the objects with large uncer¬ 
tainties, while we look at those with i < 25 out to z ~ 5 
without any clipping. We will make a fair comparison 
with an external code in Section 7. Note that, for weak- 
lensing applications, the photo-z’s in Fig. [G] are clearly 
not good enough and we do need to apply a clipping 
techni que such as the one developed bv lNishizawa et all 
( 2010 1 in order to reduce outliers. 

6.4. Bias 

There is an issue with our photo-z case 4, which is 
that the photo-z’s are biased low, bias=—0.019. This 
is illustrated in Fig. [3 in which the bias is plotted as 
a function of i-band magnitude. This bias is a seri¬ 
ous problem for weak-lensing cosmology, which requires 
mean r e dshift to be more accurate than 1 percent (e.g., 
iShirasaki fc Yoshidall2014l f. 

Interesting ly, t his systematic o ffset has already been 
observed by IDahlen et al.l (1201311 . who analyzed deep 
HST data with several photo-z codes. As shown in their 
Fig. 4, it is quite striking that most of the codes (even 
for the same code with different setups) show the same 
systematic offset of ~ —0 .01. In terest ingly, the photo- 
z comparisons bv ISanchez et all (12014 1 seem to suggest 
that machine-learning techniques do not show such an 
offset. It may be that a negative bias is a common prob¬ 
lem in the SED-based photo-z techniques. We find that 
the amount of the offset is dependent on the available 
data; if we use all the 30-band photo metry in COSMOS, 
the bias is essentially zero (in fact, lllbert et al.ll2009l ob¬ 
served no bias), but the bias increases with decreasing 
the number of filters. 

The negative bias has been observed but its origin has 
not been discussed in the literature. We have carefully 
looked into this issue, but we have not identified the root 
cause of the bias yet. We find that the N(z) prior intro¬ 
duces a negative bias, but it does not give a full account 
of the observed bias. We also find that the physical priors 
do not introduce a strong bias (see case 6 discussed be¬ 
low) . The fact that public photo-z codes that do not use 
physical priors also show a negative bias supports this 


finding. Just for now, we employ an ad-hoc correction to 
reduce the bias. Because we know we tend to underesti¬ 
mate redshifts by 0.02, we construct the template error 
functions with redshifts fixed to z spec 0.02/(1 + z spec ). 
The error functions constructed at offset redshifts reduce 
the bias as shown in Table [2j Figs. [6] and 0 (case 5), al¬ 
though they slightly increase the outlier rate. The bias 
is not completely gone as shown in Fig. [3 and further 
calibrations may be needed. We emphasize again that 
the negative bias is not a specific problem to our code, 
but a common problem. 

Finally, as the last proof of the effects of the physical 
priors, we show that photo-z’s still improve even when 
they are applied after all the other corrections. Case 6 
in Fig. [6] and Table [2] shows photo-z's computed without 
the physical priors. A comparison with case 5 shows that 
the physical priors are indeed effective in reducing the 
outliers and dispersion even when they are added at the 
end. Based on all the above results, we conclude that the 
physical priors improve photo-z’s. As a further illustra¬ 
tion of the physical priors, we show how photo-z’s change 
if we change the prior parameters in the Appendix. 
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Fig. 7.— Bias as a function of i-band magnitude. The symbols 
are for different cases as shown in the figure. Case 2, 3 and 5 are 
omitted for the sake of clarity. The dashed line is bias = 0 and the 
dotted lines are bias = =bl%. 
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6.5. Accuracy of P(z) 

An important quality control of photo-z is to assess 
the accuracy of PDF. It is difficult to estimate how accu¬ 
rate individual PDFs are, but the PDF accuracy can be 
addressed statistically. We compute fractions of objects 
that have photo-z’s consistent with the 30-band photo- 
z’s within 68 and 95 percentile intervals. If the PDFs 
are perfect, the fractions should be 68 and 95%. Devia¬ 
tions from these numbers are a manifestation of incorrect 
PDF. 

In case 5, we find that 61% and 92% of objects have 
consistent photo-z’s within 68 and 95 percentile inter¬ 
vals, respectively. It appears that our uncertain t y esti- 
rnates are slightly underestimated. IDahlen et al.l (120131) 
observed a similar trend for a number of codes (see their 
Table 5). This seems to be another common problem in 
SED-based photo-z codes. Catastrophic outliers prob¬ 
ably contribute to the problem here and the reduction 
of those outliers will help. A careful construction of the 
template error functions will be useful too (recall that 
these functions include template uncertainties and thus 
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are able to control photo -2 uncertainties). But, we do 
not perform further calibrations at this point because 
P(z) is strongly dependent on the accuracy of photomet¬ 
ric uncertainties, which is not al ways strai ghtf orwar d to 
precisely measure. The fact that lllbert et al.l (120091 1 in¬ 
creased the flux uncertainty in the catalog by a factor of 
1.5 does not seem to imply that the uncertainties are pre¬ 
cisely measured. We also find that reduced \ 2 of bright 
objects is often too large if we do not add a 0.02 mag 
uncertainty in the quadrature. We defer further work on 
P{z) to our future paper. 

6.6. Brief summary 

We have demonstrated that the physical priors and 
template error functions improve photo- 2 ’s. There are 
still issues left (e.g., understanding the root cause of the 
negative bias) and there is still room for improvements 
(e.g., priors on size and morphology), but the concept 
of the physical priors is proven to be useful. This is an 
important result. We are now motivated to compare our 
code with an external code to characterize how well our 
code works in the following sections. 


7. COMPARISON WITH CFHTLENS 

In this section, we compare our code with an external, 
publicly available code. We do not run an external code 
by ourselves in order to avoid any human biases (e.g., one 
may not fully calibrate an external code to make his/her 
own code work better). In stead, we use those from 
CFHTLenS (|Hevman s et al.l 120120 . wh ich have been 
carefully computed by Hildc brandt et al.1 (12012 ) with one 
of the most popular public codes BPZ (lBenftedl20Q0H . to 
make a fair comparison. 


7.1. Data 

The photometric redshift in CFHTLenS is based on 
data from the CFHT Legacy Survey. The ugriz pho¬ 
tometric data are obtained with MegaCam under good 
seeing conditions. We have retriev ed the photo metri c 
data from the CFHTLenS database (lErben et al.ll2013l) . 
CFHTLenS does not overlap with COSMOS and we here 
use spectroscopic redshifts from VVDS, not high-quality 
photo-z’s as done in the previous section. We focus on 
two VVDS layers: VVDS-Deep and VVDS-UltraDeep, 
which are flux -limited to i = 24 a nd to 23 < i < 24.75, 
respectively dLe Fevre et al.l(20T3T) . The surveys are per¬ 
formed with VIMOS on VLT with different instrumen¬ 
tal setups, resulting in different redshift coverages. We 
discuss them separately. We select secure spectroscopic 
redshifts with flags 3 and 4 from these surveys and then 
cross-match them with the photometric objects from 
CFHTLenS within 1 arcsec. In total, we have 3,522 and 
450 objects for VVDS-Deep and UltraDeep, respectively. 

7.2. Comparison with BPZ 

Let us start with VVDS-Deep shown in Fig. [8] The 
two codes perform similarly well; MIZUKI shows a higher 
outlier rate but with a slightly smaller scatter. The fail¬ 
ure mode of MIZUKI is that a fraction of low -2 objects 
up-scatter to high- 2 . Such failures are fewer in BPZ. An 
interesting point here is that BPZ has a small negative 
bias of —0.016. This negative bias again might be a com¬ 
mon issue for SED fitting (see discussion in the previous 


section). On the other hand, MiZUKi’s bias is only +0.003 
thanks to the template error function. 

Turning to VVDS-UltraDeep in Fig. [9l there are ob¬ 
jects all the way out to 2 ~ 3.5. Although the instru¬ 
mental redshift selection function is probably not com¬ 
pletely uniform, VVDS-UltraDeep is less biased com¬ 
pared to VVDS-Deep thanks to the very long integration 
time and a wider wavelength coverage. We find that BPZ 
tends to give very low z p hot to galaxies at 2 < z spec < 3, 
which likely comes from the confusion between the Ly¬ 
man break and the 4000A break. It might be that the 
N(z) priors adopted in the code strongly prefer the low -2 
solution, resulting in the down-scattered population. On 
the other hand, MIZUKI does not show such a strong fail¬ 
ure and the numbers of outliers at low -2 and high -2 are 
about the same, suggesting the priors are about right. 

In summary, mizuki is at least as good as BPZ and pos¬ 
sibly better at faint magnitudes and at high redshifts. 
The physical priors work well and the template error 
functions keep the bias smaller than that of BPZ. While 
further calibrations may be needed, MIZUKI is already a 
powerful tool to infer photometric redshifts. 

8. BIASES IN PHYSICAL PROPERTIES OF 
GALAXIES 

Physical properties and redshifts are very often in¬ 
ferred using different templates in the literature; one first 
uses empirical templates, PCA-based templates, or nu¬ 
merical techniques to compute redshift because they de¬ 
liver better redshifts than SPS templates. Then, one 
switches to SPS templates to infer physical properties at 
the best-fit photo- 2 . This is not a self-consistent way 
to infer them because different template sets may give 
different P(z) and the spectral shape and redshift are 
partially degenerate. Our code delivers good photo -2 
accuracy and we can compute physical properties in a 
self-consistent manner. As of this writing, the code com¬ 
putes P(M*), P(SFR ), P(ry) marginalized over all the 
other parameters. 

We use the template error functions to remove small 
systematics in the SPS templates. Also, we apply the pri¬ 
ors on physical properties. They may affect the inferred 
properties. In this section, we quantify biases in the in¬ 
ferred properties and show that they are well within the 
systematics of the SPS models that are inherent in any 
physical property estimates based on the SED fitting. In 
general, the recovery accuracy of physical properties is 
dependent on redshift accuracy, filter set (and depths), 
and details of the SED fitting technique. We have demon¬ 
strated in the previous sections that the template error 
functions and physical priors improve photo- 2 . Also, in¬ 
tensive work on the recovery accuracy fo r a number of 
filter combination is already done by iPforr et all ( 2012 ). 
For these reasons, we fix redshifts and assume only 2 rep¬ 
resentative filter sets in order to make useful comparisons 
focused on the fitting technique. 

In the following, we make two comparisons: internal 
comparison and external comparison. The first one is to 
use our code with and without the template error func¬ 
tions and physical priors to quantify differences in the 
inferred properties. The 2nd one is to compare with a 
public SED fitting code with identical assumptions in the 
SPS models. As mentioned earlier, one should use the 
full PDFs of the physical properties for science analyses, 
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Fig. 8. — plotted against z spec from VVDS-Deep. The panels are for BPZ (left) and mizuki (right). The solid and dashed lines 

show Zphot = Zspec and ( z p hot - z sp ec)/(l + Zspec) ± 0.15, respectively. 
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Fig. 9.— As in Fig. |8] but for VVDS-UltraDeep. 

but it is nevertheless useful to make point estimates. We 
take the median of PDF as done for redshift (see Eq. 21). 

8.1. Data 

For the purpose of the section, we need a pub¬ 
lic multi-wavelength catalog with physical property es¬ 
timates. NEWFIRM m edium band survey (NMBS; 
Ivan Dokkum et all [2009D is a good resource for this. 
NMBS achieved one of the most accurate photometric 
redshifts to date thanks to the large number of broad¬ 
band and medium-band photometry. This also allows 
reliable measurements of physical properties of galaxies. 
NMBS presents two catalogs: one in COSMOS and the 
other in AEGIS. We use the COSMOS catalog here be¬ 


2 

spec 


cause it includes medium-ba nd data in the optica l and 
is better than the other one ([Whitaker et alJI201lD . We 
take the griz photometry and photo- 2 ’s from the catalog 
for the analysis in the following section. 

8.2. Internal comparison 

First, we make an internal comparison. We compute 
stellar mass, SFR, and ry with and without the tem¬ 
plate error functions and the physical priors. The fits 
are performed at the NMBS photo-z’s. Fig. [TU] shows 
the differences in the inferred physical properties as a 
function of redshift. If only the priors are applied, there 
is only mild redshift dependence with a weak bias up to 
~ 20%. The template error function introduces stronger 
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redshift dependence, giving rise to ~ 30% bias at some 
redshifts. The bias for the priors and error functions 
combined is similar to that of the error function only 
case, which means that the overall bias is primarily due to 
the template error function. We know that SPS models 
do not perfectly reproduce the observed SEDs of g alax¬ 
ies, e ven for passively evolving galaxies (Marasto n et al.l 
2009). Most authors nevertheless fit SPS models and use 
the one that is most similar to an observed SED to in¬ 
fer its physical properties. The ~ 30% bias observed in 
Fig. [10] can be regarded as a level of the systematic un¬ 
certainty in this process. Fortunately, it is smaller than 
the overall systematic un certa inty inher ent in the phys¬ 
ical properties; e.g.. iConrov et all (|2009H suggested that 
stellar mass measured at z = 0 carries an uncertainty of 
about a factor of 2. While the bias introduced by the 
physical priors and template error functions should be 
kept in mind, it is unlikely to dominate the overall error 
budget. 


8.3. External comparison 

Next, we compare our code with fast dKriek et alJ 
l2009ll . FAST is a x 2 code to infer physical properties 
of galaxies by fitting observe d SED s of galaxies with 
SPS models. iWhitaker et alJ (1201 111 applied the code 
to the NMBS data under identical assumptions to ours: 
iBruzual &; Chariot! (l2003h SPS models with exponen¬ 
tially decaying star formation histor y, C habrier IMF, 
Calzetti attenuation curve dCalzetti et al.ll2000fl , and so¬ 
lar metallicity. fast is run with all the filters in the 
catalog, and a fair code-code comparison can be made if 
we use all the filters. But, we choose to primarily dis¬ 
cuss the griz photometry in what follows in order to keep 
consistency with the previous sections and also to obtain 
a crude sense of how accurately we can infer the physi¬ 
cal properties using optical data from large surveys. We 
summarize all of our results in Fig. [Til and Table [3] We 
discuss each physical property in turn. 

Stellar mass: On average, our stellar mass is about a 
factor of 2 higher than fast. This difference may not be 
too surprising because we use only 4 filters, while FAST 
used more than 35 filters over a wider wavelength range. 
At z > 1, all of the griz filters fall below the 4000A break 
and we lose an ability to constrain the stellar mass to lu¬ 
minosity ratio, resulting in the large bias and scatter at 
high redshifts. As shown in Table [3] the overall offset 
reduces to < 0.1 dex when we use all the available filters. 
Thus, the limited photometry, not the code or templates, 
is likely a primary cause of the offset. The typical statis¬ 
tical uncertainty in our stellar mass is 0.2-0.3 dex, even 
though we fix the redshift in the fits. 

We note in p assing that similar redshi ft dependence 
was observed by Ivan Dokkum et ah] (12014 ) in their com¬ 
parison between the UltraVISTA and 3D-HST data. 
They used a large number of filters to infer stellar mass 
using FAST, but they still observed a difference in stellar 
mass between the two data sets with a somewhat strong 
redshift dependence of —0.3^: dex. This seems to imply 
that, not only codes and templates, but data matter a 
lot. 

Dust attenuation: We find that our estimates are 
consistent with FAST, although the dispersion is large 
(j{tv) ~ 0.3. As discussed in lPacifici et al.l (|2012f) . we ex- 
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Fig. 10.— Ratio or difference between physical parameters in¬ 
ferred with and without the template error functions and physical 
priors (legend shown in the plots). The points and the error bars 
show the median and quartiles of the distribution. 


pect a large uncertainty in the inferred amount of atten¬ 
uation from the broad-band photometry and we should 
first note that the attenuation inferred by fast is not 
the truth table here. With this caveat in mind, it is in¬ 
teresting that we do not observe a significant systematic 
difference at any redshifts. This is somewhat surprising 
because our photometry probe only the UV wavelengths 
at high redshifts. It is likely because the amount of at- 
tenuatio n is controlled well by the physical prior. 

SFR: [Whitaker et al.l (1201 if ) computed SFRs from the 
sum of the UV and IR luminosities under the reasonable 
assumption that absorbed UV photons are re-radiated 
thermally in the IR. The bottom-left panel of Fig. [TT| 
compares SFRuv+ir with SFRs from fast based on the 
SED fitting. Assuming UV + IR is a reasonable tracer 
of SFR, we find that FAST tends to underestimate SFRs. 
At z > 1, SFRs from fast are lower by an order of 
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Fig. 11.— Top plots show physical properties inferred with our code against those with FAST. The top-left is for stellar mass and the 
top-right is for attenuation. The bottom plots show correlations between different SFR estimators. The bottom-left panel shows SFRfast 
against SFRuv+ir and the bottom-right panel is SFRmizuki against SFRuv+ir- The small panel in the bottom of each plot shows the 
ratio or difference between the vertical and horizontal axes as a function of redshift. 
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magnitude. In contrast, mizuki estimates SFRs reason¬ 
ably well as shown in the bottom-right panel. SFRs are 
slightly underestimated, but the overall agreement with 
SFRuv+ir is good and the dispersion (which is computed 
by clipping outliers as done in Section 6) is relatively 
small, a factor of < 2. Interestingly, the offset increases 
if we include all the filters out to IR wavelengths (Table 
0. One of the reasons for this would be our templates 
do not include thermal emission from warm dust yet. 

Overall, physical properties inferred by MIZUKI agrees 
reasonably well with FAST. There is a redshift-dependent 
offset in stellar mass, but it is likely due to the limited 
photometry. Amount of attenuation agrees well, and for 
SFR, MIZUKI works better than fast. Together with the 
good photo-z accuracy demonstrated in the previous sec¬ 
tion, MIZUKI is a powerful tool to infer physical properties 
in a fully self-consistent manner. 

9. ROOM FOR IMPROVEMENTS 

Throughout the paper, we have identified areas where 
further work is needed and there is clearly a large room 
for improvements in our technique. One of the ways for¬ 
ward would be to explore new priors. We have assumed 
solar metallicity in the SSP models, but given the re¬ 
cent substantial progress in the field, we could introduce 
a mass-metallicity prior. The size and morphology in¬ 
formation Jms^ no Vbeen fully exploited in the literature 
(iWrav fc Giumll2008l mav be a notable exception) and the 
size-mass and morphology-mass (or SFR) priors would 
be interestin g to exp l ore. Surface brightness might be 
useful, too (iStabenau et al.ll2008{ l. 

We also need to improve our templates. We have lim¬ 
ited ourselves to optical data in this paper, but multi¬ 
wavelength data sets are often available in deep fields. 
One urgent improvement would be to include thermal 
emission from warm dust. Because we know the amount 
of attenuation and SFR for each template, we can add 
a model of thermal emission to the stellar SEP base d 
on the energy conservation (e.g.. Ida Cunha et al.1 20081. 
In addition to galaxies, we are also interested in stars 
and QSOs. We are making progress in QSO and stellar 
models as well as priors for them and we defer detailed 
discussions on them to our future paper. 

Finally, speed. The current speed of the code is about 
2 objects per second, which is too slow to be applied to 
survey data. The code can be optimized further and the 
same is true for templates; we find that 20-30% of the 
templates generated in Section 3 never give good fits to 
the observed SEDs of galaxies. We can safely remove 
those templates from the library, which will make the 
code faster. We aim to address these issues in our future 
work. 

10. SUMMARY 

We have presented a proof-of-concept analysis of pho¬ 
tometric redshifts with Bayesian priors on physical prop¬ 
erties of galaxies such as SFR, stellar mass and dust at¬ 
tenuation. The priors are not fully optimized yet, but 


we have shown that the priors improve photometric red- 
shifts significantly. Furthermore, template error func¬ 
tions, which are intended to correct for systematic flux 
errors as well as to assign random uncertainties to model 
templates, also improve photometric redshifts. We have 
compared our code with BPZ, which is one of the most 
popular photo-z codes, and shown that MIZUKI performs 
similarly well at bright magnitudes and it works better 
at faint magnitudes and at high redshifts. 

One unique feature of the technique is that we can si¬ 
multaneously infer redshifts and physical properties of 
galaxies in a fully self-consistent manner, unlike the two- 
step procedure with different templates often adopted in 
the literature. We have compared physical properties in¬ 
ferred by MIZUKI with those from FAST under identical 
assumptions and confirmed that the inferred properties 
agree well, except for SFR, for which mizuki works sig¬ 
nificantly better. The priors and template error func¬ 
tions inevitably introduce a bias in the inferred physi¬ 
cal properties of galaxies, but we have shown that it is 
small, < 30%. The bias is primarily due to mismatches 
between SPS SEDs and observed SEDs of galaxies and 
hence it is a problem com mon to all codes that use the 
iBruzual fc Chariot ( 200 31 model to infer physical prop¬ 
erties. 

Overall, mizuki is a powerful code to infer both red¬ 
shifts and physical properties of galaxies. We have iden¬ 
tified a number of areas where further work is needed 
throughout the paper. We will improve the code using 
data from the on-going large surveys such as the HSC 
survey. Once the code becomes mature enough, we will 
make the code available to the public. 
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parameter offset ( griz ) dispersion ( griz ) 

M*,mizuki/M*,fast +0.33 dex 0.23 dex 

tv,mizuki/tv,fast +0.02 0.29 

SFRfast/SFRuv+ir 

SFR M izuki/SFRuv+ir —0.06 dex 0.26 dex 


offset (all) 
+0.09 dex 
-0.16 


—0.66 dex 
—0.34 dex 


dispersion (all) 
0.12 dex 
0.25 

0.64 dex 
0.25 dex 


TABLE 3 

Consistency between mizuki and fast. 



0 12 3 4 



0 12 3 4 


Z NUBS Z NUBS 



Fig. 12. — Photometric redshifts based on the griz photometry for the K-selected NMBS catalog. 

APPENDIX 

EFFECTS OF PRIORS AND TEMPLATE ERROR FUNCTIONS ON A A'-SELECTED CATALOG 

The physical priors and template error functions introduced in the paper do not in principle depend on which filter is 
used for object detections, but it is still important to explicitly show that they work for a catalog that is not ^-selected. 
For this, we use the NMBS catalog used in Section 8, in which the objects are detected in the Af-band. Fig. Q2] 
shows photo-z’s for case 1, 2, and 7 using the griz photometry. In case 7, only the physical priors and template error 
functions with offsets are applied (i.e., no N{z) prior). We do not show the cases that involve the N(z) prior because 
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Fig. 13. — Photometric redshifts computed using the ’wrong’ physical priors for case 2 and 5. 

the N(z) prior is for the i-band and thus it by construction does not work well for a /i-selected sample. The statistics 
are summarized in Table 0] 

A comparison between case 1 and 2 clearly demonstrate that the priors work very well for the iF-selected catalog; the 
up-scattered population at low-z is strongly suppressed and the overall scatter is reduced. In addition, the template 
error functions applied in case 7 further improve the accuracy. In particular, the bias is significantly reduced and there 
is no remaining systematic offset. Based on these results, we conclude that the physical priors and template error 
functions are effective regardless of the detection filter. 


case 

physical priors 

N(z) prior 

template errfn 

bias 

a 

J outlier 

& conv 

f outlier, conv 

i 

No 

No 

No 

-0.025 

0.090 

27.2% 

0.107 

31.3% 

2 

Yes 

No 

No 

-0.032 

0.063 

20.5% 

0.076 

18.1% 

7 

Yes 

No 

Yes (w/ offset) 

+0.000 

0.066 

18.5% 

0.064 

15.8% 


TABLE 4 

Summary of photo-z accuracies for the R-selected NMBS catalog using griz photometry. 


EFFECTS OF APPLYING ’WRONG’ PHYSICAL PRIORS 

Our physical priors are motivated by observations, but it would be instructive to show what happens if we change 
the prior parameters. As an illustrative example, we change SFR*(z) defined in Eq. [I3]by —1 dex. What it effectively 
does is to shift the observed sequence of star forming galaxies downwards by A SFR = — 1 dex in the SFR vs stellar 
mass prior defined in Section 4.3. It also changes the attenuation prior in Section 4.4 in the sense that the prior has 
larger attenuation than what is observed. 

We run our code with these ’wrong’ priors on the same COSMOS catalog as in Section 6. Here, we keep the N(z) 
prior and template error functions unchanged. The resultant photo-Ys for case 2 and 5 are shown in Fig. 1131 Statistics 
for all the cases are summarized in Table [5] Case 1 and 6 do not use the physical priors, but their accuracies are 
reproduced from Tabled for easy comparisons. It is clear that the wrong priors degrade photo-Ys. The priors give 
underestimated photo-Ys at z < 0.5 and overestimated ones at z ~ 1. This degradation is expected because the priors 
give a large probability to objects that do not really exist in the universe. In fact, by comparing with case 5 and 6 in 
Table [5j it is better not to apply the wrong physical priors. The improvement delivered by our fiducial physical priors 
shown in Section 6.3 in turn suggests that thes fiducial priors are not far from optimal. 
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